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We present measurements of the spatial clustering of galaxies with stellar 
masses > 10^^ Mq, infrared luminosities > IO^^Lq, and star formation rates > 
2OOM0yr~^ in two redshift intervals; 1.5 < z < 2.0 and 2 < z < 3. Both samples 
cluster very strongly, with spatial correlation lengths of Tq = 14.40 ± 1.99/i^^Mpc 
for the 2 < z < 3 sample, and tq = 9.40 ± 2.24/i~^Mpc for the 1.5 < z < 
2.0 sample. These clustering amplitudes are consistent with both populations 
residing in dark matter haloes with masses of ~ 6 x IO^^Mq, making them among 
the most biased galaxies at these epochs. We infer, from this and previous results, 
that a minimum dark matter halo mass is an important factor for all forms of 
luminous, obscured activity in galaxies at 2; > 1, both starbursts and AGN. 
Adopting plausible models for the growth of DM haloes with redshift, then the 
haloes hosting the 2 < z < 3 sample will likely host the richest clusters of 
galaxies at z=0, whereas the haloes hosting the 1.5 < 2; < 2.0 sample will likely 
host poor to rich clusters at z=0. We conclude that ULlRGs at 2; > 1 signpost 
stellar buildup in galaxies that will reside in clusters a.t z = 0, with ULlRGs at 
increasing redshifts signposting the buildup of stars in galaxies that will reside 
in increasingly rich clusters. 

Subject headings: galaxies: active — galaxies: starburst 

1. Introduction 

Among the core principles of current theories for the formation of large-scale structure is 
the premise that the evolution of the total mass distribution can be described by the evolution 
of Gaussian primordial density fluctuations, and that this evolution is traced by galaxies in 
some determinable way. Overdense 'halos' in the dark matter distribution are predicted 
to undergo successive mergers over time to build haloes of increasing mass, with galaxies 
forming from the baryonic matter in these haloes. This framework of 'biased' hierarchical 
buildup (Gole et al 2000; Granato et al 2000; Hatton et al 2003) has proven to be remarkably 
adept in explaining many aspects of galaxy and large-scale structure formation. 

Gomparing these predictions with observations is an active topic, and is particularly 
controversial concerning the evolution of massive galaxies (those with masses > 10^^ Mq). 
The 'naive' expectation might be that massive galaxies form over a long period of time, as 
many halo mergers are needed to build up large baryon reservoirs, and indeed some massive 
galaxies do appear to form in this way (van Dokkum et al 1999; Tamura & Ohta 2003; Bell 
et al 2004). There is however observational evidence that many massive galaxies may form 
at high redshift and on short timescales (Dunlop et al 1996; EUis et al 1997; Blakeslee et 
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al 2003), implying that the stars in massive local galaxies formed in less than a few Gyr of 
each other at z > 1. Further evidence has come from blank-field sub-mm surveys (Hughes 
et al 1998; Barger et al 1998; Bales et al 2000; Scott et al 2002; Borys et al 2003; Mortier et 
al. 2005), which have found a huge population of sub-mm bright sources (SMGs) at z > 1, 
with IR luminosities of > 1O^^L0. SMGs are far more numerous than sources in the local 
Universe with comparable IR luminosities, the so-called Ultraluminous Infrared Galaxies 
(ULIRGs). These distant ULIRGs are good candidates for massive galaxies caught in the 
act of formation; they have a median redshift of 2; ~ 2.5 (Chapman et al 2005), high enough 
star formation rates to satisfy the observed color-scaling relationships in massive evolved 
systems, and a comparable comoving number density to local massive ellipticals. SMGs 
however pose a challenge for structure formation models, which invoke a diverse variety of 
solutions to explain the observed sub-mm counts (van Kampen et al 1999; Granato et al 
2004; Baugh et al 2005; Bower et al 2006). 

Progress on these issues requires observations that can be compared to models in a 
meaningful way, and one of the best ways is via measurement of clustering amplitudes. 
Massive haloes are predicted to cluster together on the sky, with the strength of clustering 
depending on their mass (Kaiser 1984; Bardeen et al 1986). If distant ULIRGs are the 
formation events of massive galaxies, then they should trace this clustering. Measuring 
clustering amplitudes for distant ULIRGs is however difficult; it requires a minimum of 
several hundred sources found over a significant area of sky (e.g. van Kampen et al 1999), 
but IR observatories available up to now cannot map large enough areas to the required 
depths to find this number of sources at 2; > 1. 

The launch of the Spitzer Space Telescope (Werner et al 2004) offers the potential to 
overcome these problems, due to its ability to map large areas of sky in the infrared to 
greater depths than any previous observatory. In this letter, we study the clustering of 
starburst-dominated ULIRGs in two redshift bins centered at 2; ~ 1.7 and z ~ 2.5. We 
assume Hq — 70 km s~^ Mpc~^, Q — 1, and JIa = 0.7. 

2. Analysis 

To select high redshift, IR-luminous star-forming galaxies, we use the l.d/im emission 
feature, which arises due to photospheric emission from evolved stars. When this feature is 
redshifted into one of the IRAC channels then that channel exhibits a peak, or 'bump' com- 
pared to the other three channels (Simpson & Eisenhardt 1999; Sawicki 2002). A complete 
discussion of the source selection and characterization methods is given in Lonsdale et al 
2006 (in preparation), which we summarize here. 
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Our sources are taken from three fields observed as part of the Spitzer Wide Area 
Infrared Extragalactic Survey (SWIRE, Lonsdale ct al 2003); the ELAIS Nl and ELAIS N2 
fields, and the Lockman Hole, covering 20.9 square degrees in total. Wc first selected those 
sources fainter than R=22 (Vega), and brighter than 400/iJy at 24fim. Within this set, we 
selected two samples that displayed a 'bump' in the 4.5/im and 5.8/xm channels, i.e. where 
/3.6 < /4.5 > /5.8 > /s.o for ouc Sample (the 'B2' sample) and where /a.e < /4.5 < /s.s > /s.o 
for the other sample (the 'B3' sample). This resulted in a total of 1689 B2 sources and 1223 
B3 sources. 

For both samples we used the Hyper-Z code (Bolzonella et al 2000) to estimate redshifts, 
the results from which place most of the B2 sources within 1.5 < z < 2.0, and most of the B3 
sources within 2.2 < 2; < 2.8. Prom the best fits we also derived IR luminosities and power 
sources, both of which are uncertain due to the photometric nature of the redshifts. The 
requirement that the sources have /24 > 400//Jy however demands an IR luminosity of > 
IO^^Lq for all the sources. In most cases the SED fits predict that the dominant power source 
is a starburst, with star formation rates of > 2OOM0yr^^ (assuming a standard Salpeter 
IMF). Similarly, the presence of the 1.6/im feature demands (accounting for uncertainties in 
redshift and stellar models) a minimum mass of evolved stars of ~ W^Mq. Both the B2 
and B3 sources are thus good candidates for being moderately massive galaxies harboring 
an intense, obscured starburst, making them similar in nature to both local ULIRGs, and 
high-redshift SMGs. 

We measure the angular clustering of both samples using the standard parametrization 
for the probability, p, of finding a source in a solid angle Qi, and another object in another 
solid angle fl2 separated by an angle 9: 

p{e) = N^[i + uj{e)]nin2, (i) 

where N is the mean surface density of objects and u!{9) is the angular two-point correlation 
function. We compute u;{9) using the Landy & Szalay (1993) prescription, and divide u;{9) 
by the "integral constraint" (Infante 1997) to correct for the finite size of the sample (though 
in both cases this correction was negligible). The survey area geometries are accounted for in 
the analysis by constructing coverage masks of the joint MIPS 24/xm and IRAC observations, 
and only considering those sources that lie within these masks. 

We found that the levels of angular clustering seen in the three fields were consistent 
with each other to within 0.5a, with all three fields showing positive clustering for both the 
B2 and B3 samples, We therefore combined the angular clustering measures for each sample 
over the three fields. In Figure 1 we present the combined a; (6*) (with Poisson errors) vs. 9 
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plots for both samples. Both samples show a statistically significant detection of positive 
clustering on length scales of < 0.1°. To quantify the strength of clustering, we fit both 
datasets with a single power law: 



u;{9) = AJ'-^ (2) 

where is the clustering amplitude. A wide variety of sources arc well fitted with 7 = 1.8 
on small length scales (Bahcall et al 1983; Ovcrzicr ct al 2003; Wilson 2003). Wc first check 
that our two datasets can be fit this way by allowing 7 to vary and find that 7 = 1.8 is 
within la of the best-fit values. We therefore fix 7 = 1.8 and fit to get = 0.0125 ±0.0017 
for the B3s and A^ = 0.0046 ± 0.0011 for the B2s. 



3. Discussion 



If a sample of galaxies with a measured uj{6) also have a constrained rcdshift distribution, 
then we can derive the spatial correlation length, ro[z), of the sample by inverting Limbers 
equation (Limber 1953). For a redshift range spanning a to b, this can be expressed as: 



^o(^) 

m 



H-'A^cC 



J a dz 



Jl{^fE{z)Dl-\z)f{z){l + z)dz 



(3) 



where f{z) parametrizes the redshift evolution of Tq, D^lz) is the angular diameter distance, 
c is the speed of light, dN/dz is the sample redshift distribution, and: 



^ - r(i/2[r(if- 11/2) -^'-'-'"-"'^ + + « 

To compute tq from Equation 3 requires a redshift distribution, dN/dz, but we have no 
spectroscopic redshifts for our sample. Therefore, we use analytic forms for dN/dz derived 
from the photometric redshift distributions (Lonsdale et al 2006). For the B2 sources this 
is a Gaussian centered at z = 1.7 with a FWHM of 1.0, and for the B3 sources this is a 
Gaussian centered at z = 2.5 with a FWHM of 1.2. The resulting correlation lengths are 
To = 9.4±2.24/i^-'^Mpc for the B2 sources, and Tq = 14.4±1.99/i~"'^Mpc for the B3 sources. As 
a check, we derived both redshift distributions using an independent code (Rowan- Robinson 
et al 2005), and found that the mean redshifts were the same, but that the FWHMs were 
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in both cases slightly broader. Although, in general, broader redshift distributions result 
in larger clustering amplitudes, in our case the effects are insignificant; for example if we 
use the Rowan- Robinson et al (2005) redshift distribution for the B3 sources then we obtain 
tq = 16.5 ± 2.30/i~^Mpc. To be conservative, we use the Lonsdale et al 2006 redshift 
distributions. 

To place these chistcring results in context, we consider two classes of model. The 
first parametrizes the spatial correlation function, ^, as a single power law in comoving 
coordinates: 



where Tc is the comoving distance, and Tq is the comoving correlation length at 2; = 0. The 
comoving correlation length at a redshift z, ro{z), can therefore be expressed as: 



where the choice of e determines the redshift evolution (Phillipps et al. 1978; Overzier et al 
2003). Generally, models that use this parametrization are those where the correlation length 
increases or remains constant with decreasing redshift; these can be thought of as tracing 
the clustering evolution of a given halo. Several cases are usually quoted. First is 'comoving 
clustering', where haloes expand with the Universe, and e = 7 — 3; in this case clustering 
remains constant with redshift. Second is the family of models for which e > 0, for which 
clustering increases with time. Examples of this family include (a) 'stable' clustering, for 
which e ~ (in this case the size of the haloes is frozen in proper coordinates, hence comoving 
expansion of the background mass distribution makes the halo clustering grow stronger), (b) 
the predicted evolution of clustering of the overall dark matter distribution, where e ~ 7 — 1 
(Carlberg et al 2000), and Tq ^ 5 at z=0 (Jenkins et al 1998), (c) 'linear' clustering, where 
e = 1.0. A cautionary note to this is that detailed interpretations of clustering evolution from 
these models suffer from several theoretical flaws (Moscardini et al 1998; Smith et al. 2003), 
and so should be thought of as qualitative indicators rather than quantitative predictions. 
We therefore simply use the 'stable' and 'hnear' models as indicators of the possible range 
of halo clustering amplitude with redshift. 

The second class of model comprises those in which comoving correlation lengths increase 
with increasing redshift. These models introduce 'bias', b{z), between the galaxies and the 
underlying dark matter. An example of such models are the 'fixed mass' models (Matarrese 
et al 1997; Moscardini et al 1998), which assume that galaxies selected in a certain way 
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predominantly occupy haloes of the same mass. These models therefore predict the clustering 

strength of haloes of a specified mass at any given redshift. In Figure 2 we plot the e model 
for dark matter, 'stable'and 'linear' epsilon models normalized to the B2 and B3 clustering 
strengths, the fixed halo mass models for halo masses of IO-'^^Mq, IO^^Mq, and IO-'^^Mq, the 
tq values for the B2 and B3 galaxies, and the spatial correlation lengths of other galaxy 
populations taken from the literature. 

Considering these uncertainties, care must be taken when comparing the models to 
observed galaxy correlation lengths. With this in mind, we use Figure 2 to explore the rela- 
tionships between our samples, the underlying dark matter, and other galaxies. Considering 
first other measured or predicted correlation lengths at the same redshift, then both samples 
are very strongly clustered, with correlation lengths much higher than that predicted for the 
overall DM distribution at their respective epochs. Both B2s and B3s cluster significantly 
more strongly than optical QSOs at their respective epochs, and B3s cluster more strongly 
than sub-mm selected galaxies. Based on the Matarrese et al (1997) models, then we derive 
approximate la halo mass ranges of 10^^'^ < 10^^'^ for the B3s, and 10^^'^ < 10^^'^ 

for the B2s. 

The most interesting comparison is however between the two samples themselves. The 
clustering evolution of QSOs with redshift (Croom et al. 2005) may mean that there is 
a 'minimum' host halo mass for QSO activity, below which no QSO is seen, of ~ 5 x 
IO^^Mq. The correlation lengths for the B2 and B3 samples are consistent with the same 
conclusion but for a higher halo mass, tracing the clustering line for a ~ 6 x lO^^M© DM halo. 
Interestingly, from Figure 2 we would draw a similar conclusions for optically faint LBGs, 
albeit for a halo mass of ~ 10^"^ Mq (see also Adelberger et al. (2005)). Taken together, these 
results imply that a minimum halo mass is an important threshold factor for all forms of very 
luminous activity in galaxies, both starbursts and AGN. It is also interesting to speculate 
on what the host haloes of B2 and B3 sources contain at lower and higher redshifts. We 
might expect that a halo hosting a B3 source could contain an optically bright LBG at 
^ ~ 4 when its mass is ~ lO^^'^M©, followed by a B3 at 2; ~ 2.5 once the halo has reached 
~ IO^^Mq, possibly accompanied by other (near-IR selected) star forming systems (Daddi et 
al. 2004, 2005), before evolving to host an extremely rich galaxy cluster at low redshifts, with 
a halo mass substantially exceeding IO^^Mq. The occupants of a halo hosting a B2 galaxy 
would however probably be different. We would expect that such a halo could contain an 
SMG at 2; ~ 2.5, and optically fainter LBGs at 4 < 2; < 5 (though probably not LBGs 
at 2; ~ 3). At lower redshifts such a halo might host a radio-bright AGN and or ERO at 
2; ~ 1, and a (poor to rich) cluster at 2; = 0. These evolutionary paths for the B2s and 
B3s are consistent with semi-analytic models and numerical simulations (Governato et al. 
1998; Nagamine 2002) which predict that LBGs at high redshift evolve into local clusters. 



-8- 



We conclude that ULIRGs at z > 1.5 as a class likely signpost stellar buildup in galaxies 
in clusters at ^ = 0, with higher redshift ULIRGs signposting stellar buildup in galaxies 
that will reside in more massive clusters at lower redshifts. These predictions are however 
sensitive to the assumed form of DM halo growth, and should be regarded with caution. 

We thank Emanuele Daddi for saving our lives, Andy Connolly for useful discussion, and 
the referee for a very helpful report. SJO acknowledges a Lcverhulme Fellowship. Support 
for this work, part of the Spitzer Space Telescope Legacy Science Program, was provided 
by NASA through an award issued by JPL under NASA contract 1407. This research was 
carried out, in part, by JPL, California Institute of Technology, and was sponsored by NASA. 
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Fig. 1. — The two-point correlation function and (Poisson) error bars as a function of 9 for 
(left) the B2 sample, and (right) the B3 sample, in 4' bins. The three hnes show the best-fit 
power law, and the Icr limits. 
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Fig. 2. — Comoving correlation length, tq, vs. redsliift. Other data are taken from Moscar- 
dini et al 1998; Overzier et al 2003; Daddi et al. 2004; Blain et al 2004; Ouchi et al. 2004; 
Adelberger et al. 2005; Groom et al. 2005; Georgakakis et al. 2005; Allen et al. 2005. The 
'Fixed mass' lines show the predicted clustering amplitude of haloes of a given mass at any 
particular redshift, whereas the e lines show the predicted clustering amplitude of an individ- 
ual halo for three halo growth models, described in the text. The 'Stable' and 'Linear' lines 
give a qualitative indicator of the range of how DM haloes may grow with redshift, and we 
have normalized 'Stable' and 'Linear' lines to the clustering amplitudes of the B2s and the 
B3's. The shaded regions therefore indicate what these haloes may host at lower and higher 
redshifts. - the haloes hosting B3s may contain an optically bright LBG at z ~ 4 (upper 
green point), and grow to host very rich galaxy clusters at z=0, whereas the haloes hosting 
B2 sources may contain optically fainter LBGs at 4 < z < 5, SMGs at z ~ 2.5, radio-bright 
AGN (upper pink triangle) and (old) EROs at z ~ 1, and poor to rich clusters at z=0. 



